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Abstract 

Processing of symbolic sequences represented by mapping of symbolic data into numerical signals 
is commonly used in various applications. It is a particularly popular approach in genomic and pro- 
teomic sequence analysis. Numerous mappings of symbolic sequences have been proposed for various 
applications. It is unclear however whether the processing of symbolic data provides an artifact of the 
numerical mapping or is an inherent property of the symbolic data. This issue has been long ignored 
in the engineering and scientific literature. It is possible that many of the results obtained in symboUc 
signal processing could be a byproduct of the mapping and might not shed any light on the underlying 
properties embedded in the data. Moreover, in many applications, conflicting conclusions may arise due 
to the choice of the mapping used for numerical representation of symbolic data. In this paper, we present 
a novel framework for the analysis of the equivalence of the mappings used for numerical representation 
of symbolic data. We present strong and weak equivalence properties and rely on signal correlation to 
characterize equivalent mappings. We derive theoretical results which establish conditions for consistency 
among numerical mappings of symbolic data. Furthermore, we introduce an abstract mapping model for 
symbolic sequences and extend the notion of equivalence to an algebraic framework. Finally, we illustrate 
our theoretical results by application to DNA sequence analysis. 
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I. Introduction 

Information is provided in many forms. At times, information is conveyed numerically. More often, 
information is represented in the form of symbols such as characters, tags, etc. For example, the areas of 
genomic and proteomic signal processing focus on sequences of nucleotides and amino acids, respectively 
[I]. The aim of symboUc signal analysis is to process symboUc data elements in order to extract useful 
information. 

In general, symbolic information is represented as a sequence of symbols (possibly of infinite length) 
{oij^Q^, where ai E A and A is a set of all possible symbols. For example, A could be a collection of 
the 26-lowercase English letters, i.e. A= {a, b,..., z}, or the four nucleotides in a genomic sequence, 
i.e. A = {A, T, G, C}. In statistical Uterature, symboUc data is usually called categorical data [2]. The 
use of Markov chain models and hidden Markov models has been examined for time-domain analysis 
of genomic and proteomic data [3], [4], [5], [6]. However, we often seek to rely on frequency-domain 
analysis methods of symbolic signals. Unfortunately, symbohc sets do not generally possess an algebraic 
structure that allows us to define mathematical operations (e.g. group, ring, or field). In traditional signal 
processing, the set A corresponds to real- or complex-valued numbers, i.e. A = 'M. or C, which form an 
algebraic field. However, attempts to define mathematical operations such as addition and multiplication 
on symbolic data has raised many questions about the meaning of the results obtained using such methods. 

Several techniques exist which incorporate numerical and symbohc processing in an effective way 
to develop symbohc analysis systems [7]. Software systems for symbohc computational algebra (e.g. 
Mathematica, Maple, etc.) represent a successful example of this approach. Such systems, however, are 
application-specific and difficult to realize for a broad class of symbolic signal processing applications. 
There are also various techniques for analyzing correlations, periodicities, etc. that do not require the aid of 
numerical symbol mappings. Among these techniques, the Mutual Information Function (MIF) [8] is one 
of the most important. The main advantage of these methods is that numerical mappings are not required. 
Moreover, it can be shown that methods such as MIF can capture any type of statistical dependence. The 
main disadvantages of these techniques, however, are that they generally provide less specific information 
than correlation analysis and they often suffer from a systematic overestimation of mutual information for 
finite sequences. Nevertheless, in order to extract the mathematical and statistical information embedded 
in symbohc sequences, we wish to employ the powerful analysis tools developed in traditional signal 
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processing, e.g. Fourier transform, correlation function, etc. We must therefore map the symbohc elements 
into numerical values. The resulting numerical sequence should preserve the information embedded in the 
symbolic data. Moreover, it should allow traditional signal processing techniques to extract the salient 
information about the symbolic sequences from the corresponding numerical signals. For instance, in 
DNA sequences, we have a finite alphabet associated with the four nucleotides in the genome, i.e. 
A = {A, T, G, C}. The mapping used for the representation of genomic data must preserve the inherent 
structure of DNA sequences. In particular, if we choose a mapping such as: Ah^ 1, Th^O, Gh^ —1, 
C I— > 0, we would not preserve uniqueness since T and C are mapped to the same value. 

Numerous mappings have been proposed for the numerical representation of DNA sequences. Buldyrev 
et al. [9] proposed various mapping rules for the representation of nucleotide sequences into one- 
dimensional numerical sequences based on the purine-pyrimidine (RY) rule, hydrogen-bond energy rule, 
etc. Li and Kaneko [10] and Voss [11] used the indicator sequence method, which essentially maps the 
symbol to a standard basis of the 4-dimensional Euclidean space M^. Berthelsen et al. [12] revised the 
method introduced in [9] by taking the molecular mass and hydrophobicity into account in representation 
of genomic data. Silverman and Linsker [13] rehed on the simplex method, which maps the symbol to 
the vertices of a regular simplex. Cristea and Anastassiou [14], [15] proposed the tetrahedral mapping, 
which maps the nucleotides into comers of a tetrahedron. Stoffer et al. [16] introduced a mapping 
whose aim is to accentuate the periodic features embedded in genomic sequences for stationary symbolic 
sequence analysis. Wang and Johnson [17] extended the method proposed by Stoffer et al. [16] for 
non-stationary sequence analysis. Rushid and Tuqan [18] proposed the Z-curve mapping, which is a 
unique 3-dimensional curve representation whose sequences are composed of binary values, i.e. 1 and 
— 1. They also proposed a matrix-based framework to combine many widely used mapping strategies in 
genomic sequence analysis [19]. Akhtar and Epps [20] proposed the Paired Numeric and Frequency of 
Nucleotide Occurrence methods for DNA symbolic-to-numeric representation and greatly improved the 
relative accuracy for gene and exon prediction. Asif and Datta [21] developed theoretical properties for 
the Binary Indicator Sequence method. Tuqan and Rushid [22] proposed a new DSP approach for finding 
codon bias based on Voss Indicator Sequence method. 

Each of the large number of numerical mappings used for the representation of genomic sequences 
can be justified for various apphcations. This raises several fundamental questions: What are the merits 
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of each mapping used for the analysis of DNA sequences? How can we compare the results obtained 
from different numerical mappings? Indeed, it is impossible to determine which mapping is preferable. 
Furthermore, it is conceivable that distinct mappings could lead to contradictory conclusions. In fact, 
several contradictory results have arisen in the field of genomic sequence analysis. Most notably, the 
study of long-range correlations in coding and non-coding DNA sequences has been contested by several 
contradictory results [11], [23], [24], [25]. Investigation using a large DNA sequence database did not 
resolve this dispute; in fact, the controversy grew even further [26]. Bouaynaya and Schonfeld [27], [28] 
shed light on this dilemma by demonstrating that a certain class of genomic sequences are inherently 
non-stationary and thus one of the reasons for the contradictory conclusions stems from the use of 
stationary time-series analysis tools. Moreover, they determined experimentally that the results obtained 
remained invariant over a large class of numerical mappings used for the representation of DNA sequences. 
Nonetheless, the experimental study conducted by Bouaynaya and Schonfeld in [27], [28] cannot be used 
to ascertain with certainty whether the different numerical mappings used for representation of genomic 
sequences contributed to the contradictory findings reported in the literature [11], [23], [24], [25]. 

To ensure a clear understanding of the imphcations of the different choices used for numerical 
representation of symbolic data, we must develop a fundamental new approach that can be used to 
characterize the fundamental properties of numerical mappings. Specifically, it is essential that we 
establish a mapping equivalence theory for symbolic data that can be used to guarantee consistency 
among a class of numerical representations. With the aid of a mapping equivalence theory we could 
determine whether different mappings should yield compatible results, i.e. whether the mappings used 
for the analysis of the same data lead to consistent conclusions. Moreover, the theory can indicate 
when distinct mappings could lead to contradictory results and thus comparison of the corresponding 
conclusions is futile. 

In this paper, we provide a mapping equivalence theory for the numerical representation of symbohc 
data undergoing transformation by an operator. We focus primarily on the mapping / : ,4 ^ which 
maps the symbols to the n-dimensional Euclidean space. In Section n, we first propose a framework for the 
analysis of different numerical mappings undergoing transformation by an analytic operator using Taylor's 
expansion. Moreover, we emphasize the investigation of first- and second-order operators including the 
correlation function and Fourier transform. These operators are widely used in signal processing and 
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analysis and thus play an important role in this presentation. In Section II-A, we provide an analysis of 
the correlation between different numerical mappings of a symbolic sequence. In particular, we derive 
conditions for strong equivalence captured by perfect correlation among distinct mappings. In Section 
II-B, we explore a relaxed similarity measure between distinct numerical mappings. Specifically, we 
provide conditions for weak equivalence which is characterized by preservation of the local extrema of 
the representation. In Section m, we introduce an abstract mapping model and extend the concept of 
equivalence to the generalized mapping model. In Section IV, we present experimental results which 
illustrate the significance of the proposed mapping equivalence theory in symbolic signal processing ap- 
phcations. In this presentation, the simulations are focused exclusively on analysis of genomic sequences. 
The results presented in this paper, however, are applicable for any symbolic signal modeled by a discrete 
alphabet with a finite cardinahty and independent of particular statistical properties such as stationarity, 
etc. Finally, we provide a brief summary and discussion of our results in Section V. 

II. Euclidean Mapping Equivalence for Symbolic Sequences 

Given {aj}^Q^, where Ui E A and \ A\ < oo, here | • | means the cardinality of the set. / is a mapping 
from A to M."', i.e. f : ai ^ Xi, xi G M". After the mapping we obtain a vector sequence {xi}f^. 
r : Xj T/i is a transformation from to W^. is an analytic operator on the numerical sequence 
and maps into M parameterized by Z G R. We also assume that G ^^(0- We classify the problems as 
in the following cases. 

1) Given T, determine the consistency between #;({a;i}^Q^) and ^i{{T{xi)}^S'f^). We need also 
figure out the largest class of operators which shows consistent results for two mappings under the 
given T. 

2) Given / and if / and T o f are consistent for any symbolic sequence {aj}^o^- ^^^^ out the 
largest class of such transformation T which preserves the consistency. Also figure out the largest 
class of transformation T preserving the consistency for given mapping /. 

The consistency here means we require the results under two different mappings to be similar in 
certain extent. In general may not be linear. We will use Taylor's expansion to expand the operator. 
We vectorize the vector sequence {xi}^^^ Xi G M" to a large vector x G M^"^^. Consider the 
Taylor's expansion of the analytic operator. : M^"^^ M. Without using the common scalar form 
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representation of Taylor's expansion [29], we shall present it in a concise form by using tensor product. 
First, we define the gradient operator V as 

V =( ^ -A. (1) 

V dXl dX2 ' ' ' dXNn J 

Then the Taylor's expansion of at xq can be represented as the following form, 

oo 

= ^(V'*^)(^0) Xi {X - Xo) Xi_i {X - Xo) Xi_2 • • • Xi (X - xo) (2) 
i=Q 

Where x , is the i^^ mode tensor product [30], and V* is the i*^ order gradient of which is defined 
as, 

V^*z = *i xi V X2 V X3--- Xi V (3) 

Furthermore, V°*/ is defined as For one- and second-order terms, it is easy to check that it 
coincides with the well-known definition of Gradient V*;(x) and Hessian V^*;(x). So we can 
rewrite the Taylor's expansion at xq as, 

=*i(xo) + V^l{xQf{x - Xq) + ^(X - XQfV'^^i{xQ){x - xo) 

+ V ^(V**/)(xo) Xj (x - Xo) Xi_i (x - Xo) Xj_2 • • • Xi (x - Xo) (4) 

1=3 

A metric or measure is needed for measuring the consistency. In general, there is no universal metric. 
Various operators may have different metrics for different purposes. In many cases, it is a reasonable 
principle to require the results of two different mappings to be similar in some extent. In hght of this 
principle, we propose the following two kinds of metrics. 

A. Strong Equivalence: Perfect Correlation 

We will use the correlation coefficient to characterize the consistency. First we provide the definition 
of the correlation coefficient p used in this paper. 

Definition 1: Given {ai\fj^, where G A, \A\ < oo. f : Qi ^ Xi, x G M", T : Xj i— ?/j is a 
transformation from M" to W^, is an operator on the numerical sequence. m{^i) = /j ^id/j, 
is the mean value of the in the space L of parameter I G M. is a measure on M. The correlation 
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coefficient is defined as 



(5) 



The use of abstract integration provides a unified framework for definition of the correlation coefficient. 
The measure /j, can be chosen to be any Borel measure such as the Lebesgue-Stieltjes or counting measures 
depending on the properties of the operator. In practice, the measures we rely upon are mainly the counting 
measure and the Lebesgue measure. 

It is well known that the correlation coefficient is between [—1,1] [31]. The correlation coefficient 
can be used as a measure to characterize the similarity of two different mappings. For a given T, if 
p = 1, then we say the transformation T is a strongly equivalent transformation of the map / for an 
operator and ^ii{T{xi)}^J^^) is a strong equivalence of ^i{{xi}^J^^). When the correlation coefficient 
is 1, it means the two mappings are the same up to a translation and scahng. This is the reason that 
it is called "strongly equivalent." Unfortunately, there is no the universal equivalent transformation for 
arbitrary operator. However, because of the importance of second-order statistics, we shall emphasize 
on the second-order operators such as the correlation function. From now on we will focus on the 
transformation T from M" to MP. For the case of mapping between Euclidean spaces with a different 
dimensions, we will present a detailed discussion in Section II-D. 

We first consider the correlation function. The correlation function of a sequence is defined as. 



Then if p = 1, we have the following theorem on the strongly equivalent transformation T. 

Theorem 1: For non-trivial operator and Unear transformation T, the correlation coefficient p = 1 if 
and only if the transformation T can be represented as T{xi) = XRxi, R is an orthogonal matrix and 




N-l 



(6) 



n=0 



A G M. 
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Proof: If T{xi) = ARxj, and R is orthogonal. Then 



n=0 



n=0 



= ArK{x,}iIo') 



(7) 



Conversely, if p = 1. and T(xj) = Axj. Then 



iV-l 



n=0 




(8) 



n=0 



where A € M and c € R is a constant. Since the equality holds for any sequence and any I. So A"'^ A = XI, 



Actually, this property not only holds for correlation operator, but also for a larger class of operators. 
Consider the Taylor's expansion of an operator We would Uke first to introduce the definition of 
bounded linear operator and Riesz representation theorem [32]. Then we will present a result for the 
first- and second-order bounded operators. 

Definition 2: Let {X, || • ||) be a normed space. A operator / : X ^ M is a bounded operator if / is 
hnear and there exists C > 0, such that |/| < C||a;||. 

The bounded operator can be thought as an analog of BIBO hnear system in signal processing theory, 
which illustrates the good behaved operators. Furthermore if the space X is a Hilbert space, we have the 
following theorem to characterize any linear bounded operator. 

Theorem 2: (Riesz Representation theorem for Hilbert space) X is a Hilbert space, then for any hnear 
bounded operator ^, there exists a unique y £ X, such that ^(x) =< x, y >. 

Note that with the usual dot product is a Hilbert space. Therefore, Riesz Representation theorem 
for Hilbert space holds for M". As before, we vectorize the vector sequence {xi}^^^ Xi G M" to a large 

June 11, 2009 DRAFT 



then A is orthogonal. 



vector X G R^""^^. Then any linear transformation T can be represented in the form 



(9) 



\ / NnxNn 

i.e. y = Tx. 

Then we have the following theorem for equivalent transformation of first- and second-order operators. 

Theorem 3: Any non-trivial bounded linear operator does not have any non-trivial (scaled identity 
mapping) linear strongly equivalent transformation. If rotation is a strongly equivalent transformation 
for a bounded operator whose Taylor's expansion does not have the third or higher-order terms, then its 
Taylor's expansion can not have first-order term and the Hessian V^*;(x) must have the form 



klll-nxn ki2lnxn 



21-'nxn 



22^nxn 



klNl-nxn 
k2Nlnxn 



\ 



(10) 



y kmlnxn kN2 Inxn ' ' ' kj^Nljixn J 

, where kij G M and kij = kji, Vi ^ j. 

For Fourier transform, in many situations, we focus exclusively on the modulus of the transform 
of symbolic data, i.e. we discard the phase information. Since the module of continuous-time Fourier 
transform is invariant under rotation, it is tempting to conclude that rotation is an equivalent transformation 
for the Fourier transform. However, the widely used form of the Fourier transform used in much of the 
literature devoted to DNA sequence analysis [17] is different from the classical multi-dimensional Fourier 
transform. Fortunately, we are able to show that rotation still yields an equivalent transformation. We 
first define the Fourier transform as: 

L = ^\\XL^\\l (11) 
where X is an x N matrix, whose i^^ column is Xi. Lp is the frequency vector, i.e. 



Lf = ( 



e N 



e N 



e " 



-2»-jm(iV-l) , 2^ 



(12) 
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If we vectorize X to x e M^^^^ as before, fm can also be represented as fm = ■^\\Ax\\2, where 

^ / -27r.im0 -27rjml -27r3m2 -27rjm(W-l) \ /-I OX 

^ - I e J e " i„xn e iv / ■■■ e n I I (1 J) 

V ^ nxAf 

notice that fm = -^{x^ Ax), which is a second-order operator and A^A is of the form, 



A^'A 



nxn 



(14) 



y WNWoInxn WNWiInxn " " " WnIVnIuxu J 



here Wi = e By theorem 3, rotation is a strongly equivalent transformation for Fourier transform. 



B. Weak Equivalence: Preservation of Local Extrema 

In the previous section, we employed the correlation coefficient as a metric to characterize the similarity 
for an operator under transformation. However, as we can see, the strong equivalence basically requires 
the result to be "exactly" the same. While in many situations, we do not focus on whether or not the 
result under two mapping strategies are exactly the same, i.e. the true numerical value of the result, but 
the relative relation or the relative trend of the result. For example, when we use the correlation function, 
in many cases, we only care where the peak and valley points are located and the changing trends, which 
are used to determine the periodicity structure of certain patterns. In these cases, what we really need is 
to preserve the local extremums and local trend under the a transformation. So we first give the definition 
of Local Minimum and Maximum Preserving Similarity or in this paper what we call weakly equivalent. 

Definition 3: Given {ai}^^, where G A, \A\ < oo. f : Ui Xi, x E R", T : Xi ^ yi 
is a transformation from to W^, is an operator on the numerical sequence. We say T is weakly 
equivalent, if for every x, which is a local minimal or maximals for ^x{{xi)'^^^) then ^x{{T{xi)}^^Q^) 
is also a local minimal or maximal respectively. 

A few easy observations and results follow. By definition strong equivalence imphes weak equivalence. 
Moreover, we have the following propositions to determine weak equivalence. 

Proposition 1: If is twice differentiable with respect to I, then T is weakly equivalent, if for any 
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I, where ^^'^^^^^-o ^ = O, the following conditions hold 

Af-l 



dl 

and 



dp ' dp - ^ ^ 



Proof: if Hs a local maximal or local minimal, then = and ^''^'^^^2^*'=°'^ < or 

d'^,i{xii^^o') > 0. By the definition of weak equivalence, (15) and (16) follow. ■ 
If i G Z, Then we have the following criterion to determine weak equivalence. 

Proposition 2: T is weakly equivalent for an operator where Z G Z, if for any I, the following 
condition holds 

{mXi}^=o') - ^l-l{{x^}l-o')) ■ {mT{x^)}l-o') - *,-l({r(x.)}iIo')) > (17) 



Proof: "Without loss of generality, we assume I is a local maximal for ^i{{xi}^Q^). then ^i{{xi}^Q^) > 
*i-i({a^O£o') and *K{^i}£o') > *i+i({^OiIo')- If (17) holds, we have 

M{T{xi)}fj,') > *,_i({T(x,)}iIoi) and *Km^O}.=o') > *;+i({r(x,)}£o')- Thus I is also a 
local maximal for */({r(xi)}^^o^). ■ 

As the importance of second-order statistics, specially we would hke to investigate the weakly equiv- 
alent transformation for the correlation function. We first introduce a lemma. 

Lemma 1: If the transformation TiM" — > R" is an inner-product preserving isometry, i.e. x^y = 
T{x)'^T{y), ^x,y G M", then T{x) = Hx, where R is an orthogonal matrix. Hence T is a bijective 
isometry. 

Proof: First let x = y, we have ||x||2 = ||T(x)||2, i.e. T preserves the Euclidean norm. Since r(a;o) 
is on the ball ||a;||2 = ||a;o||2> we have T{x) = Ii{x)x, where R(a;) is an orthogonal matrix function. Let 

R(x) = ( ui{x) U2{x) ... u{x)) (18) 
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, where {ui{x)} is orthonormal. Furthermore, let x = (1, 0, 0, ... , 0)-^ and y = (yi, 0, 0, . . . 0)-^. then we 
have 

x'^y = T{xfT{y) (19) 
yi = xll{x fK{y)y = yiuj{{l, 0, 0, ... , Of)ui{y) (20) 

Therefore ((1,0, 0, . . . ,0)^)ui{y) = 1. By Cauchy-Schwartz Inequality, we have 

1 = txf ((1,0,0, . . .,Of)uiiy) < ||ni((l,0,0, . . . ,0)^)||2 • \\ui{y)\\2 = 1 (21) 

The equality holds if and only if ui{y) = A'Ui((l, 0, 0, . . . , 0)^), but ||tti(?/)||2 = 1, thus A = 1. 
Therefore 

ui{y) = 'Ui((l, 0, 0, . . . ,0)^),Vy G M". By the same arguments we can show Ui{y) = Ui{ei), Vi = 
l,...n, where Cj is the standard basis of R". So R(x) = R is a constant orthogonal matrix, thus 
T{x) = Rx. This also shows T is a bijective isometry. ■ 

For correlation function, we have the following theorem showing that generally speaking, rotation can 
be thought as the "only" weakly equivalent transformation. 

Theorem 4: For a fix length sequence, any transformation which only brings small enough changes to 
the iimer product value under previous mapping will be a weakly equivalent transformation for correlation 
function. However, if the length goes to infinity, then rotation (or scaled rotation) is the only weakly 
equivalent transformation for correlation function. 

Proof: Consider the vector sequence {a;j}^Q^. The correlation function is 

I Af-l N _ 1 sr^N-l T, 



n{{xi}i=o ■) = N En=o ^ {n)x{n - I). Then we have 

ri - ri-i = -77 XI x^{i^){x{i^ - - ^C"' - ^ + 1)) (22) 



n=0 

After the transformation, we have the correlation function 
rl = n{T{{x,}l-')) = En=o T{x{n)rT{x{n - I)) and r[ - rl_, = En=o T{x{n)f {T{x{n - 
l)) — T{x{n — l + l))). By Proposition 2. T is weakly equivalent if r[ — r[_i has the same sign as ri — ri-i. 

Consider the alphabet A! = A\J{a — 6| a, 6 G A}, which means we consider the symbol "a — b" as a 
new symbol and we extend the mapping / on the newly added symbols as / : (a — 6) i-^ (/(a) — /(^))' 
which also extends the transformation T for /(a — h) be T{f{a)) — T{f{h)). Thus finding the weakly 
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equivalent transformation is same to find the T which preserve the sign at each I of cross correlation 
function Ri for the sequence {xjjj^Q^ and {xj — Xj+ij^Q^. 

Let = ( ^2 ... ua where = f{x)'^f{y), {x, y) or {y, x) G ^ x ^' and ^ = |^ x A'\. 
Also let Ni = { Ci C2 ... Ca where Cj is the counting number for the pair {x, y) corresponding 
to Oj which appears in the cross correlation function Ri. Therefore we have 

Ri = j^^'^Ni (23) 

Define T{J^) = { bi 62 ■■■ where bi = T{f{x))'^T{f{y)), where {x,y) corresponds to 

{x, y)in the Oj in JT. Notice that Ni will not change since it is determined by the given sequence. After 
the transformation, the cross correlation functions becomes i?| = j^T{!F)'^Ni. We need Ri and i?J have 
the same sign for all I. Notice that E^=i Ci = N. So every A^; for a given sequence corresponds to a 
point on the hyperplane J2t=i Xi = N in M"^. If T preservers signs for all /, then for each Ni, T{J^) 
should reside in the same half plane of J^. Because T{J^)'^Ni should have the same sign of J^'^Ni. In 
general, the sign will not all be positive or negative, since that will means the correlation function is 
monotonic which in general is not vahd for all sequences. Consider all possible symbol sequence of 
length iV and / G Z. Then 

TiJ") G fl {x\{x^Ni)-{J'^Ni)>0} (24) 

all sequences of length N 
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This implies T should reside in the intersection of all half plane determined by all sequences of length 
N. Each half plane is a convex cone, therefore the intersection is still a convex cone as illustrated in 
Fig. 1. Since we have finitely many point on the hyperplane Ylf=i^i — ^ ■ -^^l reside in the first 
quadrant. We can always construct two hyperplanes whose intersection is in one quadrant. Thus the 
intersection will be a small convex cone in a quadrant. If T{J^) is in that convex cone, then the sign is 
all preserved, which means T is a weakly equivalent transformation. This proves the first claim. 

However, if we let N go to infinity, first notice that the intersection will not be empty set, since !F is 
always in the intersection. But the points Ni become dense in the first quadrant. We can have a sequence 
of Ni such that N^J-' —>■ 0"*" and with N^J^ 0~. Therefore the intersection will be squeezed to the 



hne \J^, A > 0. i.e. The r(jr) = AJ^. If the scaling A = 1, tiien T{f{x)fT{f{y)) = f{xff{y), V/. 



Since the mapping is arbitrary, which means it should hold for any x and y. By lemma 1, T is a rotation 
or a scaled rotation. 

Conversely, by theorem 1, we have rotation or scaled rotation is a strongly equivalent transformation. 



From this theorem, we can see that rotation is essentially the only weakly equivalent transformation 
for correlation function. We can expand the class of operator having this property by combining theorem 
4 and theorem 3 with some technical conditions, then we have the following corollary. 

Corollary 1: If a second order operator whose Hessian is of the form as (10) and all ki have the same 
sign and < J^ki < oo, then rotation is essentially the only transformation which is both strongly and 
weakly equivalent. 

Proof: If the Hessian has the form above, then 



Thus it follows it is also a weakly equivalent transformation. 




N-l 



(25) 



n=0 



, Where 



1 X > 



sgn{x) = < 



-1 X < 
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Notice that the proof for correlation function follows here except we shall use Nl = { (iCi C2C2 • • • CaCa 
instead of Ni, where Q are all positive and 



If the length goes to infinity, Ni becomes dense, since < ^ fcj < cxd, then iV/ is in some non-degenerated 
bounded set, which is still dense and resides in the first quadrant. Then the argument above still valid. 
Thus rotation is essentially the only weakly equivalent for this kind of operator. By theorem 1, rotation 



C. Mapping Between ID Euclidean Spaces 

The case in which the transformation T : — > is Umited to n = 1 is particularly common in the 
literature. Moreover, this case stands out and deserves special attention since the only possible rotation 
on M is obtained by scaling and interchanging the mapping values. From Theorems 1 and 4, we observe 
that if the mappings cannot be obtained by scahng and interchanging the mapping values, the correlation 
and Fourier analysis results obtained using these mappings are neither strongly nor weakly equivalent. 
Therefore, in the ID case, the equivalent mapping class under a given operator becomes fairly limited. 
In general, distinct mappings will usually lead to inconsistent correlation and Fourier analysis results. 
Another interesting fact about ID mappings that can be derived from our previous results is that if the 
mapping is binary (i.e. the range of the mapping can only takes two distinct values), then we observe 
that the correlation and Fourier analysis under any two such binary mappings are always consistent since 
we can always obtain one of the mappings by scahng and interchanging the mapping values of the other 
mapping. 

D. Mapping Between Euclidean Spaces of Different Dimensions 

In the previous sections, we focused primarily on the transformation T : W^, where n = m. 

In this section, we will present a brief discussion of the case where m ^ n. If m > n, which means 
T will transform the vector into a larger dimensional Euclidean space. However, since there is a natural 
embedding for M" into W^, we can always think the transformation as T':W^ W^. For second-order 
operators which are shown equivalent under rotation, we still have the same results in this situation, 
except the rotation matrix here means a mattix have orthonormal columns. 
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For the case m <n, v/e can also think as is embedded inside MP by the transform 

(Imxm \ 
X (26) 
0(n— m)xm / 
^ ' ' nxm 

, where x G M"*,y G W^. Then we only need to research on the new transformation T':W^ —>■ M". 
However, in this case, we can see that we actually project the higher dimensional subspace into a lower 
dimensional space, rotation here in general is not an equivalent transformation anymore. Intuitively, 
because of the projection, we lose information projected on (n — m) dimensions. Therefore rotation is 
no longer an equivalent transformation. 

III. Abstract Mapping Equivalence for Symbolic Sequences 
A. Abstract Mapping Model and Examples 

In previous sections, we mainly focused on properties of mappings, which map symbols into vector 
space. However, it is not necessary to restrict to the vector space. Many classical concepts in numerical 
signal processing can be extended to various algebraic structures. For example, the Fourier transform 
and Wavelet transform can be defined on group, ring and finite-field [33], [34], [35]. In this section, we 
introduce the generahzed mapping to arbitrary semi-ring, ring or algebra structure. We shall also extend 
the notion of equivalence defined in previous section. 

For given finite alphabet A, we define F{A) as collection of all the symbolic sequences and define the 
binary operation as concatenating two symbolic sequences. It can be shown F{A) is a free semi-group 
[36]. R is any semi-ring. Let TZ to be collection of all maps from F{A) to R. For any / G 7^, we denote 
it as the formal series, 

f= J2 •^(^)^ (27) 

ueF{A) 

we define two operation -|- and • on 7?. as 

f + 9= E + (28) 

{f-9){s)=J2ifi^)9{v))s (29) 

uv=s 

With these two binary operations, we have the following proposition to show that we construct a new 
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algebraic structure on TZ 

Proposition 3: R is a semi-ring or ring, then {TZ, +, •) forms a semi-ring or ring respectively. 

Proof: By the definition of addition, we can see that if {R, +) is a commutative monoid or abelian 
group, then {TZ, +) has the same property correspondingly. Therefore it's enough to show to show {TZ, •) 
is a semi-group, i.e. we need to show the multipUcation is associative. 

yf,g,hen 

ifgMs) = J2 Ugi^mw) 

xw=s 

xw~s uv=x 

= {fiu)g{v))h{w) 

uvw=s 

= E fiu)i9{v)h{w)) 

u{vw)=s 

= f{9h){s) (30) 

Therefore the multiplication is associative. We proved the proposition. ■ 
The ring TZ is called as the semi-group ring of F{A) with coefficients in R. Furthermore, if i? is a 
left-i?' module for some ring R'. We can define for any r G R\ 

then TZ is the left-i?' algebra. The TZ may be interpreted as the generalized filter space, while the F{A) 
is as the signal space. The multiplication can been thought as the extension of discrete convolution. If 
we let R and R' to be R, ^ = {0, 1, 2, . . . }, the multipUcation degenerates to classical convolution. The 
symbol sequence is mapped into a numerical sequence. 

Another example of the abstract mapping model is the probability model. Consider all the outcomes 
of the words in F{A). Denote the outcome space as O. a; is a cr-algebra on O and P is a probabiUty 
measure on a. Notice that two set-operations on a, fl and U is analog of ■ and and O can be seen 
as and 1 respectively. Therefore R = (a;, U, fl) forms a semi-ring. The probability measure mapping 
P here is interpreted as a semi-ring mapping from TZ to the semi-group ring of F{A) with coefficients 
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in R, which is defined as, 



p{f)= E ^p^(/(^)> 



•u 



(32) 



ueFiA) 



The probabiMty operations then can be realized by algebraic operations on TZ and the corresponding 
probability measure values are obtained after the mapping P. 

B. Abstract Mapping Equivalence 

For generahzed mapping, the equivalence problem is still worth for investigating. However, in the 
situation, it becomes much more difficult than in a M-vector space. The R in general does not possess 
any meaningful ordering. Therefore the definition of equivalence turns out to be limited for specific 
application. Nevertheless, as we mentioned before, in most cases, it is reasonable to require the result to 
be similar in certain extent. From now on, we assume i? is a integral domain with unity 1. We introduce 
the following definition for abstract equivalence of a generalized mapping. 

Definition 4: For any f,geTZ,f and g are abstractly equivalent, if the ideals they generated are the 
same, i.e. (/) = (g). 

The next proposition shows the intuition and legitimacy of this definition. 

Proposition 4: (/) = (g) if and only if / = ug, where u has multiplicative inverse. 
Proof: If (/) = (^f), then / = uig and g = U2f for some ui,U2 G TZ. We have 

/ = UiU2f 



Since R is integral domain, we have uiU2 = 1. ui and U2 are units. 

Conversely, if / = ug, then g = u"^/. We have (/) C (g) and (g) C (/), therefore (/) = (g). ■ 
A loose interpretation of Proposition 4 implies that abstractly equivalent mappings only differ by a 
"scale" and that the scale change can be "reversed." Let us first consider the case where the semi-ring R 
is M or C. In this case, R forms a field and thus any non-zero element is a unit. It is easy to show that 
in this case strong equivalence implies abstract equivalence. To see that strong equivalence is a special 
case of abstract equivalence, let us consider mappings / and g to be defined at the origin "0" of the 
field (i.e. we ignore the translation between the mappings). If non-trivial mappings / and g are strongly 
equivalent, then / = eg, where c is a non-zero real or complex number. We observe that c is a unit 
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and therefore its inverse c exists. Finally, we note that the strongly equivalent mappings / and g are 
abstractly equivalent. 

We now extend the discussion to semi-ring R given by M" or C". We note that R forms a vector 
space over M or C. We recall that the orthogonal hnear operator is a necessary and sufficient condition 
for strong equivalence under the correlation function. Moreover, we note that the set of orthogonal linear 
operators forms an orthogonal group 0{n) given by 0(n) = {M e C'*'''* : M^M = /}, where I 
denotes the identity operator. The orthogonal group contains the special orthogonal group SO{n) which 
represents usual rotations and is given by SO{n) = {M € C"^" : M = I and det{M) = 1}. Finally, 
we observe that if / and g are strongly equivalent under the correlation function, then / = Mg, where 
M G 0{n). Since M is in the orthogonal group 0(n), we note that it is a unit (i.e. = G 0(n)). 
Therefore, we once again conclude that strong equivalence imphes abstract equivalence. 

IV. Applications and Examples in Genomic Signal Processing 

As we discussed in previous sections, approach of mapping the symbolic sequence to is a widely 
adopted method for symbolic signal process. Therefore the consistency problem for results using different 
mappings always arises. In this section, we will apply our theory to genomic signal processing. 




N 

(a) Correlation coefficients for 
strong equivalence 



1 p 

0.95 




(b) Percentage of points preserving local extremes 
for weak equivalence 



Figure 2. Two consistency measurements for correlation results using two mapping methods change with the growth of sequence 
length N for AD169 DNA sequence. 



We conduct experiments on Human gene AD169 sequence (GenBank accession no. X17403). We 
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calculate the correlation function as in (6) using two different mappings. The first one maps the v4 = 
{A, T, G, C} to the standard basis of correspondingly. Then we use another mapping strategy, which 
maps A to (-1, 0, 0, 0), T to (1, 0, 0, 0), G to (0, 1, 0, 0) and C to (0, -1,0, 0). These are two widely used 
mapping methods [11], [12]. In Fig. 2(a), we show the changing of correlation coefficient between the 
two correlation results with growth of DNA sequence length N and in (b) we show how the percentage 
of the points having same local extremum property in two results grows with N. The second mapping is 
not obtained by rotation of the first mapping. As a result, all these two metrics have a decreasing trend 
with the grown of length N. 

We also calculate the two metrics on rhodopsin gene sequence (GenBank accession no. U49742) for 
these two metrics and the result is shown in Fig. 3. 




50 100 150 200 250 

N 



(a) Correlation coefficients for 
strong equivalence 



0.85 




50 100 150 200 250 

N 

(b) Percentage of points preserving local extremes 
for weak equivalence 



Figure 3. Two consistency measurements for correlation results using two mapping methods change with the growth of sequence 
length N for rhodopsin gene sequence. 

These two examples show the same trends for two metrics between two mappings. The similarity 
between the two results become less and less, which finally may lead to an inconsistent analysis results 
due to the fact that two chosen mapping methods are not equivalent for the correlation function. Thus it 
does not make sense to make comparison between the analysis result for a given gene sequence under 
these two mapping methods. 

In Fig. 5, we show the two consistency measurements between the power spectrum under the previous 
two mapping methods. Figures 5(a) and 5(b) show the correlation coefficient and the percentage of the 
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200 250 300 



(a) Weak equivalence metric between results using (b) Weak equivalence metric between results using 
the first and the second mapping the first and the third mapping 

Figure 4. Percentage of points preserving local extremes for Fourier transform using three different maps changes with growth 
of sequence length N for human gene AD169 sequences. 



points having same local extremum property in two power spectrum results grow with the sequence length 
N for human gene AD 169 respectively, (c) and (d) show the strong and weak equivalence measurements 
changing with the length N respectively for rhodopsin gene sequences. Although the equivalent transform 
we analyzed before does not mainly focus on power spectrum, we can still find that the power spectrum 
results using these two different mappings have the trend to be inconsistent. Since the correlation and 
power spectrum are widely used and pervasive in statistic analysis, it suggests the consistency problem 
should not be neglected when comparing analysis results. 

Research on statistical properties of coding and non-coding regions in nucleotide sequences is an 
important topic in genomic signal processing [27], [28]. We shall also conduct experiments on coding 
and non-coding regions of Human gene TXNDC9 and NOC2L using the two mapping methods introduced 
earlier. As shown in Fig. 6, for both genes, the consistency measure between the correlation functions 
decays as the length N increases. As a result of our analysis, we note that the correlation results under the 
two mappings are inconsistent in the long run. Furthermore, any comparison between the analysis results 
obtained by relying on these correlation functions becomes increasingly unreUable. Another interesting 
result can be observed in Fig. 6, where the decay rate of the non-coding region is faster than the coding 
region. This phenomenon could be attributed to the fact that the coding regions can be viewed as more 
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(a) Correlation coefficients for (b) Percentage of points preserving local extremes 

strong equivalence for weak equivalence 
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(c) Correlation coefficients for (d) Percentage of points preserving local extremes 

strong equivalence for weak equivalence 

Figure 5. (a), (b) shows the correlation coefficient and percentage of points preserving local extremes of the power spectrum 
change with growth of sequence length A'' for Human gene AD 169 sequences respectively using mapping which maps A = 
{A, T, G, C} to the standard basis of K*. (c) and (d) shows same two consistency measurements respectively change with 
growth of sequence length N for rhodopsin gene sequences using the map which maps A to (—1, 0, 0, 0), T to (1, 0, 0, 0), G to 
(0, 1, 0, 0) and C to (0, -1, 0, 0). 



random than the non-coding regions. Nevertheless, the main conclusion that we draw our attention to is 
that the consistency of the correlation between non-equivalent mappings decays as the sequence length 

increases for both coding and non-coding regions. 

We also calculate the Fourier transform as defined in (11) on Human gene AD169 sequence. The 
first mapping is chosen as before, which maps the .4. = {A, T, G, C} to the standard basis of 
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(a) Correlation coefficients for 
strong equivalence of coding 
and non-coding regions of TXNDC9 




Non coding 
- G - Coding 



150 
N 



(b) Correlation coefficients for 
strong equivalence of coding 
and non-coding regions of N0C2L 



Figure 6. Comparison the consistency measure of correlation functions clianging witli growtli of N for coding and non-coding 
regions of Human gene TXNDC9 and NOC2L. 



respectively. Then we use second mapping strategy, which maps A to (0.9912,0.1322,0,0), T to 
(0.8367,-0.239,0.1195,0.4781), G to (-0.7505,-0.5361,-0.2144,0.3216) and C to 
(0.7804, -0.5103, -0.2401, -0.2701). The third strategy maps A to (^,0, ^,0), T to (0, ^,0, ^), G 
to (""^1 0' 0) ^i^d '-'to ('-*)~"^iO)"^)- We have normalized the mappings so that it will not change 
the energy of the result. The second mapping is not obtained by rotating the first mapping. While the 
third mapping is obtained by rotating the first mapping. In Fig. 4, we show the weak equivalence metrics 
for Fourier analysis results. Figure 4(a) shows consistency between 1 and 2 results becoming less and 
less. While (b) suggests a completely consistent results. In Fig. 7, we show the analysis result of the three 
mappings. As we showed before, (a) and (c) are exactly same, since rotation is a strongly equivalent 
transformation. We can find many differences between (a) and (b), especially at the peaks. We also 
calculate the correlation coefficient between (a) and (b), which is 0.82. The peak here means the repeat 
pattern of some periodic sequences, however, since we have shown that the mapping is not equivalent 
here, it makes no reason to debate on possible conflicting analysis results for this gene sequence. 

In Fig. 8, we illustrate consistency measures for the mapping T : W^, where m ^ n. The first 

mapping we used is the standard Voss mapping introduced earlier. The second mapping we employed is 
the RY rule [9], which maps A, G to 1 and T, C to —1. In this case, the transformation T : — is not 
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(a) Fourier transform under the first mapping 
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(b) Fourier transform under the second mapping (c) Fourier transform under the third mapping 



Figure 7. Fourier transform using three mapping methods. 



an equivalent transformation as discussed in Section II-D. From Fig. 8, we observe that the consistency 
between the mappings decays as sequence length N increases. 

In all of the experiments conducted we observe that rotation serves as the unique equivalent transforma- 
tion for the correlation function. Rotation also provides a strongly equivalent transformation for Fourier 
and spectrum analysis. Mappings which are not equivalent lead to inconsistent results as the sequence 
length N increases. However, we must point out that the opposite may not be true: specifically, for a 
fixed-length sequence, the consistency for any two mappings does not necessarily decay as the difference 
between these two mappings increases, measured in the sense of rotation equivalence, i.e. the similarity 
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(a) Correlation coefficients for (b) Percentage of points preserving local extremes 

strong equivalence for weak equivalence 

Figure 8. (a), (b) shows the correlation coefficient and percentage of points preserving local extremes of the power spectrum 
change with growth of sequence length A'^ for Human gene AD 169 sequences respectively using mapping which maps A = 
{A, T, G, C} to the standard basis of R** and the RY rule 



between the first mapping and any mapping obtained by rotating the second mapping. 

V. Conclusion 

In this paper, we presented a novel framework for analysis of the equivalence of distinct numerical 
mappings of symbohc sequences undergoing a transformation by an operator. We introduced a strong 
equivalence property that demands perfect correlation between the transformations of distinct numerical 
representations. We also characterized the weak equivalence property which requires the preservation of 
the extrema in the transformation of the numerical representations. We studied the mapping equivalence 
theory for general operators by using Taylor's approximation. Moreover, we focused on first- and second- 
order operators such as the correlation function and Fourier transform. Furthermore, we derived the largest 
class of equivalent mappings which lead to consistent results when undergoing transformation by a class 
of operators. We demonstrated that rotation plays an important role in characterization of equivalence 
between distinct mappings. We subsequently derived a class of operators which is equivalent under 
rotations. We also introduced an abstract mapping model and extended the notion of equivalence to a 
more general algebraic structure. We presented simulations of the mathematical and statistical properties 
of genomic sequences in order to demonstrate the imphcations of the proposed mapping equivalence 
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theory. Our results suggest that one of the reasons for inconsistency in the analysis of genomic data 
reported in the theoretical biology literature as well as many other related areas can be attributed to 
incompatibility of the numerical representation of symbolic data. For instance, we have shown that some 
of the mappings used for the representation of genomic data are incompatible and could have led to the 
contradictory conclusions reached in the analysis of long-range correlations of DNA sequences. 



The authors would like to thank Prof. Alan Willsky from the Massachusetts Institute of Technology 
for discussions that served as the impetus for the work presented in this paper. 



Proof of Theorem 3: 

Notice that M" with inner product < x, y >= x^y is a Hilbert space. So for any linear bounded 
operator, 3y S W\ such that ^i{x) = x'^y. So ^i{T{x)) = T^x^y. T is strongly equivalent, therefore 
^l{T{x)) = \^i{x)+c for some A and c. Then we have T = XInuxNu^ i-e. T is a trivial scaled identity 
transform. This finishes the proof of the first claim. We claim if a non-trivial operator whose 
Taylor's expansion has no terms of order higher than or equal to three has a non-trivial Unear strongly 
equivalent transformation, then it must only have the second order term and the constant term. We can 
always scale or add constant for the transformation to get a strongly equivalent result after transformation. 
So without loss of generality, we assume the result after the transformation is exactly the same as the 
previous one, i.e. If ^i{x) = ^x^V^*/(0)x and */(r(a:;)) = then we have 



this equality holds for any x G ]R^"^i. Therefore we have r^(V2*;(0))r = T is a rotation. 
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Appendix 



^x^v2*^(o)x = ^x^r^(v2*;(o))rx 



(34) 



I.e. 



/ 



■nxn 



n 



(35) 



T = 



\ 




NnxNn 
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Since R-^R = /, we have 



(36) 



Because R^R = RR^ = /, T^T = TT^, therefore T is normal, T is unitarily diagonalizable [37]. 
R is also normal. Therefore 3V unitary, such that R = V^A'V, where A' is a diagonal matrix. Since R 
is real orthogonal, the eigenvalues of R are on the unit sphere S^. Without loss of generality, we assume 
R has two eigenvalues, 1 and /x. Let the algebraic multiplicity of 1 be i, then the algebraic multipUcity 
of fi is n — i. So we have 

(37) 



LetU 



( 



V 



V 



\ 



V 



A' 



,A 



A' 



M-^(n— i) X [n—i) 



From (36), we have {V'^i^i{Qi))U" kU = U^AUV^^i{0). 



Therefore we have 



(38) 



Let X = U{V^^i{0))U". We have XA = AX. By using the Jordan canonical form [38, Chapter 
VIII], we have that all X which commutes with A must have the form as follow: 



X 



An 





A12 





Aim 








B21 





-B22 





B2M 


A' 

^12 





^31 





A3M 








-^22 





-B41 





BiM 


A' 





A' 





A(2N-l)l 








B2M 





B'am ' 





B{2N) 



\ 



(39) 



V 

Every non-zero submatrix in X is an arbitrary upper triangular submatrix which has identical diagonal 
entries. All submatrices A^i and A'^j^ have size i x i, all B^i and B'^.^ have size (n — i) x (n — i). Thus 
all V^*/(0) satisfies (36) are of the form V^*/(0) = U^XU. However, since the */(x) is analytic, the 
Hessian must be symmetric. Therefore all the submatrices of X, A^i and B^i have the form kkil and 
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A'j^i and B'l^^ have also the form kkil. Notice that this is for a given rotation. Since (36) holds for any 
given rotation, we have that 

v^*/(o) G y = f] {u^xu} (40) 

U U nitary 

The rotation is arbitrary, (37) should hold for any i = 0, . . . ,n. Claim that the x principal matrice 

A' 



and 



in X must satisfy: 



— kijinxn 



(41) 



A' 



B' 



h' T 



where i = 1, 3, 5, . . . , (2A'' — 1) and j = 1, . . . M. Because we know that 



kijiixi 



B, 



^{i+'i-)j-^in-i)x{n—i) 



(42) 



(43) 



but if kij + A;(j+i)j, then this imphes {U^XU} {\{U^X'U} = 0, where (37) for X' is of the form 



A' 



'(i+l)x(i+l) 



(44) 



A'-^(n-i-l)x(n-i-l) 

Since we know Y is not empty, we get a contradiction here. Therefore (41) and (42) hold. If we choose 

U = I, we have Y C X, where for X, (41) and (42) hold. It's straightforward to check that such X is 
commutable with T. Therefore Y = X. Finally we show that 



k2llnxn k22lnxn ' ' ' k2Nlnxn 



\ 



y kNlInxn k]S[2lnxn ' ' ' ki\fisflnxn j 



(45) 
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where kij G M and kij = kji, Vz 7^ j. If we expand at any other point xq, then 

^{x-xofV^^i{xo){x-xo) = ^x'^V^^iixo)x-^x'^V^^i{xo)x-^x'^V^^i{xQ)xo+^xoV^^i{xo)xo 

(46) 

The only second order term is ^x^V^^i{xo)x. Repeat the previous argument, we will have (10). ■ 
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